Global approaches for finding small RNA and small open reading frame functions.

نویسندگان

  • Karen M Wassarman
  • Patricia J Kiley
چکیده

Sequencing of the first Escherichia coli (K-12) genome revealed that there were 4,300 open reading frames (ORFs) expected to encode proteins and many stable RNA genes that encode rRNA and tRNA (3). Surprisingly, at that time only 43% of the ORFs had been previously described, and 38% could not be assigned even predicted functions. These observations suggested that there was a wealth of information still awaiting discovery even in this extremely well-studied model organism, which spurred great interest and focused studies on the ORFs of unknown function. Over the past decade it also has become increasingly apparent that there is an abundance of additional genes that play important roles in cellular physiology; these genes encode small RNAs (sRNAs) and small proteins (small ORFs [sORFs], defined as 50 amino acids) that were overlooked in the initial annotations due to their small size and/or lack of open reading frames (8, 16). Most of the sRNA and sORF genes are currently of unknown function; however, sRNAs and sORFs of known function often have regulatory roles, frequently in signal transduction pathways and in coordinating regulatory networks (6, 8, 16). Therefore, there is great interest in these gene classes and the identification of their functions, and two papers in this issue of the Journal of Bacteriology, by Hobbs et al. (9) and Hemm et al. (7), present new genome-based approaches to search for functions of sRNAs and sORFs. Although directed genetic and genomic approaches (1, 2) have been quite successful in identifying functions for proteincoding genes and in reducing the number of ORFs of unknown function (10), these approaches have been less profitable for identifying sRNA and sORF gene functions. Loss of function of regulatory sRNA-encoding genes typically leads to more subtle phenotypes than commonly observed when regulatory proteins are mutated, perhaps due to the observation that sRNA regulation is usually more modulatory in nature than that of their protein counterparts that commonly direct largescale changes (16). As a consequence, phenotypes associated with mutations in sRNA genes can be difficult to recognize without additional information pointing toward a specific time, event, or condition to explore. In addition, sRNA and sORF genes are significantly smaller targets for traditional genetic mutation, and sRNA genes are not subject to frameshift or nonsense mutations (since they do not code for proteins), making it more difficult to generate loss-of-function alleles by general mutation. Therefore, it is perhaps not surprising that sRNA and sORF genes have not been identified readily in classical genetic screens or selections. More global approaches, such as mRNA expression profiling and proteomic studies, also have been extremely powerful for characterization of many ORFs and have led to identification of many cellular functions. However, once again such approaches have not been as successful for studies of sRNAs and sORFs, due in part to the fact that most commercially available microarrays do not include representation of these genes, as they have only been recently identified, and that general protocols used for proteomic studies, including mass spectrometry or two-dimensional gel electrophoresis, are not well suited for studying small proteins. The two highlighted papers from Gisela Storz’s group, the reports of Hobbs et al. (9) and Hemm et al. (7), tackle the problem of characterization and identification of function for sRNAs and sORFs, respectively, and present their progress in large-scale directed studies especially suited to the study of these genes. Importantly, the approaches described in these papers are also generally applicable to functional studies of any unknown gene in E. coli or other microorganisms. In the first paper by Hobbs et al. (9), the authors demonstrate the power of analyzing DNA bar-coded mutants in mixed population studies by uncovering mutant phenotypes associated with several small RNAs and small proteins. For these experiments, strains were generated in which genes of interest were individually replaced by bar codes that were originally designed for construction of mutant libraries in yeast (5, 11). The bar code contains two 20-mer sequences (UP and DN) that are unique for each mutation generated. Flanking the specific UP and DN sequences are three additional sequence elements (Fig. 1, common sequences com1, -2, and -3) that are identical for all bar codes. These “common” sequences serve as priming sites for PCR amplification of the UP or DN bar codes from each strain (Fig. 1, arrows 1a and 1b for UP and 2a and 2b for DN). Thus, isolation of genomic DNA from a mixed population of mutant cells allows the amplification of the UP and DN barcodes using just two sets of primers (Fig. 1). The relative abundance of each bar code within the PCR products should be representative of the relative abundance of each strain within the population. In this way, each mutant can be independently tracked, even in a culture containing many different genotypes, by hybridization of amplified bar code DNA to a commercially available DNA array designed to detect the bar codes. Note that each bar code has two sequence elements (UP and DN) that can be analyzed independently. The bar-coded mutant approach clearly facilitates analysis of a large number of mutants at once, providing quantitative representation of each strain present. However, in addition, it * Corresponding author. Mailing address: Department of Biomolecular Chemistry, 1300 University Avenue, University of Wisconsin— Madison, Madison, WI 53706. Phone: (608) 262-6632. Fax: (608) 2625253. E-mail: [email protected]. Published ahead of print on 23 October 2009.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

شناسایی RNA های غیرکدکننده کوتاه ‌عملکردی با استفاده از روش های بیوانفورماتیکی در گوسفند و بز

MicroRNAs (miRNAs) are small non-coding RNAs that have functional roles in post-transcriptional modification. They regulate gene expression by an RNA interfering pathway through cleavage or inhibition of the translation of target mRNA. Numerous miRNAs have been described for their important functions in developmental processes in numerous animals, but there is limited information about sheep an...

متن کامل

The nucleotide sequence and gene organization of red

Red clover necrotic mosaic virus, a member of the dianthovirus group, is characterized by a genome composed of two nonhomologous single-stranded FNAs of approximately 4.0 (RNA-1) and 1.4 kb (RNA-2). The complete nucleotide sequence of the RNA-2 has been determined. RNA-2 i s 1448 nucleotides in length with a 5 terminal m7G cap and no 3 terminal poly-A t a i l or 5 terminal VPg. An open reading ...

متن کامل

Incredible RNA: Dual Functions of Coding and Noncoding

Since the RNA world hypothesis was proposed, a large number of regulatory noncoding RNAs (ncRNAs) have been identified in many species, ranging from microorganisms to mammals. During the characterization of these newly discovered RNAs, RNAs having both coding and noncoding functions were discovered, and these were considered bifunctional RNAs. The recent use of computational and high-throughput...

متن کامل

The nucleotide sequence and gene organization of red clover necrotic mosaic virus RNA-2.

Red clover necrotic mosaic virus, a member of the dianthovirus group, is characterized by a genome composed of two nonhomologous single-stranded RNAs of approximately 4.0 (RNA-1) and 1.4 kb (RNA-2). The complete nucleotide sequence of the RNA-2 has been determined. RNA-2 is 1448 nucleotides in length with a 5' terminal m7G cap and no 3' terminal poly-A tail or 5' terminal VPg. An open reading f...

متن کامل

Specific residues at every third position of siRNA shape its efficient RNAi activity

Small interfering RNA (siRNA) induces sequence-specific post-transcriptional gene silencing in mammalian cells. Different efficacy of each siRNA is considered to result from sequence preference by protein components in RNAi. To obtain mechanistic insight into siRNA functionality, here we describe a complete data set of siRNA activities targeting all possible position of a single mRNA in human c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Journal of bacteriology

دوره 192 1  شماره 

صفحات  -

تاریخ انتشار 2010